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Abstract 

We discuss the behavior of two magnitudes, physical complexity and mutual information 
function of the outcome of a model of heterogeneous, inductive rational agents inspired in 
the El Farol Bar problem and the Minority Game. The first is a measure rooted in 
Kolmogorov-Chaitin theory and the second one a measure related with Shannon's 
information entropy. We make extensive computer simulations , as result of which, we 
propose an ansatz for physical complexity of the type C(l) = l a and establish the 
dependence of exponent a from the parameters of the model. We discuss the accuracy of 
our results and the relationship with the behavior of mutual information function as a 
measure of time correlations of agents choice. 

PACS numbers: 05.40.-a, 05.65.+b, 89.90.+n, 02.50.-r 

Key words: Minority Game, entropy, physical complexity, mutual information function. 



1 Introduction. 

In many natural and social systems agents establish among them self a complex network of 
interactions. Often in such systems it is the case that successful agents are those which act in 
ways that are distinct from their competitors. El Farol bar problem [1] has become a 
paradigm of such kind of models. The next step in this endeavor is the well known Minority 
Game model [2], [3]. Numerical simulations by several authors [2], [4-8] have shown that 
minority game (MG) displays a remarkably rich emergent collective behavior. 
The setup of the minority game is the following: The N agents have to choose at each time 
step whether to go in room -1 or 1. The agents who have chosen the less crowded room 
(minority room) win, and the other loose. The agents have limited capabilities, and only 
"remember" the last m outcomes of the game. The number m is called memory size or 
brain size. In order to decide in what room to go agents use strategies. It is the way 
inductive reasonning is made. A strategy is a choosing device, that is, an object that process 
the outcomes of the winning room in the last m time step and accordingly to this 
information prescribes in what room to go the next one. 

The agents randomly pick s strategies at the beginning of the game. After each turn, the 
agents assign one (virtual) point to each of his strategies which would have predicted the 
correct outcome. At each turn of the game, they use whichever is the most successful 
strategy among the s in his possession, i.e., he choose the one that has gained most virtual 
points. Variations of this setup are studied in the remarkable paper [9]. 
In order to understand the complex behavior of the model, particular emphasis have been 
devoted to study the mean square deviation of the number of agents making a given choice 
o . In the financial context, this observable is called volatility. Although the great amount of 



papers devoted to study this quantity (see [4] and [5] for example), A. Cavagna [6] proved 
that the behavior of the above mentioned measure does not depend of the real history of the 
game. Hence, we proposed [10] an alternative new approach for the study of the complex 
behavior of minority game borrowing tools from thermodynamics and statistical physics 
[10]. 

The above mentioned models capture some "stylized facts" of real financial markets, 
because due to formidable complexity of their behavior, deductive rationality is unrealistic in 
such strategic situations. It is generally accepted that agents follow more likely strategies 
which have been more successful in the past. The well know "law of efect" [11] reflect this 
kind of behavior. 

In the original minority game agents make inductive reasoning but only reward each strategy 
in an imperfect way. They do not take into account the impact of their choice in the 
macroscopical observable. In that sense agents are naive. In the paper [9] another kind of 
behaviors are studied which yield more sophisticated conduct of agents. It depend on the 
way agents reward all their strategies and how they estimate their impact on the market 
index. 

The aim of this paper is to study how physical complexity and mutual information function 
of the outcomes of the game change, when the conduct of agents vary from naive to more 
sophisticated behavior. A measure of this change cp is defined and we establish an ansatz for 

the physical complexity C(l) = / a(m s <! ' ) . We make extensive computer simulations to show 
that a(m,s,cp) = a(m, s)q> + b(m, s) and calculate approximated expression for a(m,s) and 
b(m, s) . We also discuss the implication of that behavior of C(l) in the form of the mutual 
information function. 



The paper is structured as follows. In Sec. 2 we introduce our measure of the sophistication 
of the conduct of the agents and discuss the relationship with similar result obtained in [9]. 
We also breifly review the result defined in [10] and establish an ansatz for physical 
complexity. We discuss how the form of the exponent is related with the efficiency of the 
market and with the memory size and the number of strategies used by agents. In Sec. 3 we 
study the implication of the above result on the behavior of mutual information function of 
the string of outcomes of the model. Section 4 is for conclusions and in Sec. 5 one could 
find the references. 

2 The Behavior of Physical Complexity. 

We first breifly review some concepts. Physical complexity (first studied in [12-14]) is 
defined as the number of binary digits that are meaningful in a string 77 with respect to some 
enviroment £ . As in [10] we take in the capacity of £ , the stream of data composed by the 
outcomes of the model and study the physical complexity of substrings 77 of £ ,. In terms of 
Kolmogorov-Chaitin entropy, physical complexity can be written as: 



where ^(77 I £) [14] is the length of the smallest program that computes 77 in a Turing 
machine having £ as input: 



Notice that \r]\ also represent (see [10] for details) the unconditional complexity of string 77 
i.e., the value of complexity if the input would be e = . Of course, the measure K(r\ : £) 
as defined in Eq. 3 has few practical application, mainly because it is impossible to know the 
way in which information about £ is coded in 77 . However (as proved in [14]), if a 



K(rj : £) = 77 - K(7] I £) 



(1) 




(2) 



statistical ensemble of strings is available to us, then the determination of complexity 
becomes an exercise in information theory. It can be proved that the average values C(\j]\) 

of the physical complexity K(j]:e) taken over an ensemble E of strings of length \rj\ can be 
approximated by: 

C(\rii) = (K(Ti:e)) z ^-K(I./e) (3) 

where: 

K(L /£) = -£ p(ri I e) \og 2 p(ri I e) (4) 

and the sum is taking over all the strings 77 in the ensemble E . In a population of N strings 

72(77) 

in the enviroment £ , the quantity — , where n(s) denotes the number of strings equal to 
77 in E , approximates p(r\ I e) as N — > °° . 

Let £ = a l a 2 a 3 ---a n --- ; a, e{-l,l} be the stream of outcomes of the game and / a 
positive integer l>2. Let E, the ensemble of sequences of length / built up by a moving 
windows of length / i.e., if 77 e E ; then 77 = afl^'-a^^ for some value of i. We 
calculate the values of C(l) using this kind of ensemble E ; . As we will show by our 
simulations C(l) scale as: 

C(l) = r "^' (5) 
where m is the memory size of the agents, s is the number of strategies each of them bear 
and cp (to be defined later) is a measure of how sophisticated is the behavior of the agents. 
We borrow some definitions of [9]. Let U si (t) be the score that agent i assigns to strategy 
s in the iteration t of the game. These scores are updated in the additive way: 



U SJ (t + l) = U SJ (t) + A^p(t,s i ,s_ i ) (6) 
where the term (t,s, -,s_, •) quantifies the perceived succes of the strategy s if agent i 
played s t and the "aggregate" choice of the other agents were s_ t . The value fi(t) represent 
the state of the system at time t . 

In the Minority Game the agents are naive. They only know the payoff delivered by the 
strategy s t (t) which they actually played. Hence, naive agents neglect their impact on the 

macroscopical observable A M (t) . To be precise: 

a m A m (t) 

(f , (0, (0) = - (7) 

where a£-° is what would have played agent i if she would have selected strategy s and 
the state of the system would have been the real one, labeled here by fi(t) . 
In [9] the following correction to Eq. (7) is proposed: 

(t, Si (t), (0) = - + cp (8) 

where the added term models the tendency of the agents to stick to the strategy they are 
currently using. If <p = then we have Eq. (7). 

Notice that agents using Eq. (7) behave as "price takers" [9]. They do not take into account 
their influence in the macroscopical observable A M . Agents using Eq. (8) repair in some 
sense this deficiency. Hence, a question naturally arise: Why don't reward to all strategies 
which give as outcome the same result as that actually played? More than that, why don't 
reward to all strategies with a quantity proportional to the payoff the agent would have 
obtained if she would have played it? In fact, several of the strategies of one agent could 



give the same answer as that actually played. Beside, the only way in which agents "impact" 
the market in these models is by their contribution to macro scopical observable A M . No 
rumors, no news spread among them. Therefore, we propose a more realistic way of reward 
strategies: 

U MJ (f + D = U„ (f ) ^ — '- (9) 

That is, reward every strategy with what it would have won (lost) if the agent would have 
played it. Agents following Eq. (9) are able of real complex counter-factual thinking and we 
called completely sophisticated agents. As stated in [9] concept sophisticated as oposed to 
naive is borrow from [15]. 
Notice that Eq. (9) can be written as: 

U, t (t + 1) = U ti it) - " - + ? ^ (10) 

Therefore, the correction to naive behavior to become completely sophisticated is the term 
mm _i 

— - — . We finally propose the following procedure to reward strategies: 

U„ (t + l) = U, t it) - " - + cp ^ (11) 

If <p = the agents rewarding procedure is naive. If (p = 1 the agents rewarding procedure 
is completely sophisticated. When (p moves from to 1 Eq. (11) goes through all rewarding 
procedures from naive to completely sophisticated. 

We make extensive computer simulations for several values of m, s and (p . For every 
m = 2,. ..,5, s = 2,. ..,5 and (p e {o,0.1,...,l} we repeat ten times the computer experiment 



and for each of them construct the correspongding physical complexity function C(l) . They 
fit very well to a power function for every values of m, s and cp . The exponent a is always 
positive. In Fig. 1 can be seen the plot of C(l) v.s. / for m = 4, s = 3, (p = 0.4 : 
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Fig. 1: Values of physical complexity v.s. the length of string of set . The continuos line 
represent the fitness to a power function. 

Later we averaged the values of exponents for each triplet (m, s, (p) conforming a 3D array 
with entries m,s and (p. With these data we performed a regression of exponent a with 
respect to the variables m , s and (p of the form: 

a(m,s,(p) = a(m,s)(p + b(m.s) (13) 

where: 

a(m,s) = (a,m 2 + a 2 m + a 3 )s 2 +(a 4 m 2 + a 5 m + a 6 )s + a 7 m 2 +a g m + a 9 



b(m, s) = (£,m 2 + b 2 m + b } )s 2 + (b 4 m 2 + b 5 m + b 6 )s + b 7 m 2 + b % m + b 9 
The choice of this ansatz is based in our observations about the behavior of a with respect 
to m and s for every fixed <p . As we show later it is fully justified. The values of 
coeficients appear in the following table: 

Table I 

Regresion coeficients in Eq. (13) 

fll = 0.24450 a 2 = -2.07810 a, = 4.86120 
a 4 = -1.4423 a 5 = 1.581800 a b = 3.74100 
a 7 = 1.86670 a, = -15.2176 a 9 = 34.5322 

b x = -0.1696 b 2 = 1.5818 b 3 = -3.7410 
b 4 = 2.1426 b 5 = -19.4888 b b = 44.5105 
b 7 = -1.9711 b % = 17.1166 b 9 = -34.1939 

In the Fig. 2 we show a representation of function a(m,s). The plot has been done 

continuously to enhance the behavior of a(m,s) . Notice that for every pair (m,s) the value 

of a(m, s) is negative. In our opinion it is related with a kind of efficient market hypothesis 

for this models . The efficient market hypothesis [16] establish that the market is highly 

efficient in the determination of the most rational prices. All available information is instantly 

processed when it reachs the market and it is immediately reflected in a new values of prices 

of assets traded. In our models, as cp increases the agents reward more suitably each of their 

strategies. The value cp = 1 represent the perfect scenario in which 



Fig. 2: The plot of the function a{m,S) . Notice that for every values of Wl and S the function is negative. The plot 
has been done continuously to enhance the behavior of the function. 

each agent not only knows what he won (lost), but also what he would have won (lost) if he 
would have used the remainder strategies. Hence, as <p increase the string £ become more 



unpredictable and function C(Z) more flat. As we showed in [10] random sequences have 
flat function C(Z) . 

Therefore, the function a(m,s) can be interpreted as the rate of increase of randomness with 
respect to the increase of the "knowledge" of the agents (p . In the next section we study 
how changes in (p affect the time correlation of macroscopical observable. 
3 The behavior of mutual information function. 

In the last section we show how the increase of <p increases the randomness of the series of 
strings of the outcomes of the model . However, nothing has been said about the correlation 
of the outcomes along the time. Because the distance between two binary symbols in £ 
represent the number of time iterations between them, a measure of the degree of correlation 
between elements in the symbolic string yield information about time correlation of the 
outcomes of winner room. 
The mutual information function is defined as: 

M(d) = Y,P aP (d)\og 2 

where: P a p(d) is the probability of having a symbol a followed d sites away by a symbol 
jS and P a the density of the symbol a . It can be proved (see [10] an references there in) 

that mutual information function is a very sensitive measure of correlation. 
The most important feature of mutual information function of the string £ is his remarkable 
persistence of high correlation at some large distances as well as sudden "falls" also at some 
large distance. For cp small the lack of correlation accumulates yielding profound and 
periodic "falls" as can be seen in Fig. 3. In a market with these features the possibility of 



P a Pp 



(6) 
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Fig. 3: Mutual information function for the values m = 3, s = 2, (p = • Notice the periodic behavior of the 
function. 

speculation would exist if some agents could anticipe these "falls". We claim that this fact 
can be explained by the anomalous diffusion of the variable fl(t) [17] which is close related 
with the behavior of some cellular automata rules [18, 19]. We acknowledge the primacy of 
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Fig. 4: The mutual information function for several values of (p and m , s fixed. Form top left to bottom right: 
(p = 0.2, <p = 0.4, (p = 0.6, (p = 0.8 • The other values are: m = 4, s = 3 

D. Chalet, M. Marsili and R. Zecchina [9] in the understanding of conection between 
behavior of /n(t) and Bruijn diagrams. As q> increase two process develop due to the 
improve on the information level of the agents, as can be seen in Fig. 4: the profound "falls" 
become more sparsed. Speculation with great profit is less possible. Small "falls" appear 
more often. All of them are more smart and the margin of profit is smaller. When<p — > 1 
almost no great "falls" appear and the series is very noisy. 
4 Conclusions. 

We have developed a more rational way of reward the different strategies of the agents. The 
parameter q> is a measure of how good the agents understand the game and hence reward 



their strategies. We have shown that as q> increase the number of binary digits which are 



predictable in reference to the whole series £ decreases for every length / of the string 
considered. It implies that the whole series £ become more unpredictable as we showed in 
[10]. This behavior is reflected in the mutual information function of £ . The great "falls" 
appear less often, but small oscilations become more frecuently. It is close related with the 
increase of coordination of the agents. The ansatz proposed allow to obtain a measure of the 
loss of predictability as <p increase: the coefficient a(m,s) . As we have shown, it is always 
negative. This can be interpreted as a kind of efficient market hypothesis because as far as 
the agents better incorporate the information about the outcomes the curve of C(l) become 
more flat in agreement with a more random behavior. 
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